Fault-Tolerance of Parallel Volume Rendering on Cluster of PCs

نویسندگان

  • Sergio Guedes
  • Cristiana Bentes
  • Gabriel Pereira Da Silva
  • Ricardo C. Farias
چکیده

In this paper we address a very important issue in parallel rendering systems, reliability. Distributed systems, such as clusters of PCs, are low-cost alternatives for running parallel rendering systems. However, distributed systems are usually not reliable, machines can fail during the rendering process, resulting in incomplete final images. Therefore, our goal is to take advantage of specific features of the parallel rendering applications, like tile-based computation, to include mechanisms to dynamically detect machine failure and automatically process tasks retrieval, with low overhead and no extra hardware. We developed three different parallel rendering systems, all based on the Parallel ZSweep algorithm[5], to provide fault-tolerance in different ways. Our experimental results show that the three systems present a small overhead to detect the failures, and when a failure occurs, the redistribution of the work does not degrade the system performance. We conclude that it is possible to provide fault-tolerance at low-cost in a cluster of PCs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Volume Rendering for Ocean Visualization in a Cluster of PCs

Volume rendering techniques can be very useful in geographical information systems to provide meaningful and visual information about the surface and the interior of 3D datasets. For ocean visualization, in particular, volume rendering techniques improve the analysis of the ocean inner structure, by generating visual information about, e.g., its temperature, salinity, velocity and mass. The ren...

متن کامل

Image-Space-Parallel Direct Volume Rendering on a Cluster of PCs

An image-space-parallel, ray-casting-based direct volume rendering algorithm is developed for rendering of unstructured data grids on distributed-memory parallel architectures. For efficiency in screen workload calculations, a graph-partitioning-based tetrahedral cell clustering technique is used. The main contribution of the work is at the proposed model, which formulates the screen partitioni...

متن کامل

Providing Single I/O Space and Multiple Fault Tolerance in a Distributed RAID

Commodity EIDE disks provide low cost storage but are severely limited in bandwidth and cannot be made fault-tolerant. On the other hand, conventional RAID devices provide reliability and performance but worse price/performance figures. A cluster of PCs can be seen as a collection of networked low cost disks; such a collection can be operated by proper software so as to provide the abstraction ...

متن کامل

Porcupine: A Highly Available Cluster- based Mail Service

Clusters are a common vehicle for improving scalability and availability at a single service site in the network. Are network services the “Killer App” for clusters? • incremental scalability just wheel in another box... • excellent price/performance high-end PCs are commodities: high-volume, low margins • fault-tolerance “simply a matter of software” • high-speed cluster interconnects are on t...

متن کامل

Fault - Tolerant Clusters of Workstations with Single System Image

he computing trend is moving from clustering highend mainframes to clustering desktop computers. This trend is triggered by the widespread use of PCs, workstations, Gigabit networks, and middleware support for clustering. This paper presents new approaches to achieving fault tolerance and single system image (SSI) in a workstation cluster. A multicomputer cluster is a collection of node compute...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004